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Abstract 

We study the spread of stochastic SIR (Susceptible —>■ Infectious —>■ 
Recovered) epidemics in two types of structured populations, both con¬ 
sisting of schools and households. In each of the types, every individual 
is part of one school and one household. In the independent partition 
model, the partitions of the population into schools and households are 
independent of each other. This model corresponds to the well-studied 
household-workplace model. In the hierarchical model which we intro¬ 
duce here, members of the same household are also members of the 
same school. 

We introduce computable branching process approximations for 
both types of populations and use these to compare the probabili¬ 
ties of a large outbreak. The branching process approximation in the 
hierarchical model is novel and of independent interest. We prove by 
a coupling argument that if all households and schools have the same 
size, an epidemic spreads easier (in the sense that the number of indi¬ 
viduals infected is stochastically larger) in the independent partition 
model. We also show by example that this result does not necessarily 
hold if households and/or schools do not all have the same size. 


1 Introduction and context 


Mathematical modeling of the spread of infectious diseases has a long his¬ 
tory [Diekmann et ah 2012 . A commonly used model for epidemics, the 


SIR epidemic in a closed population, is easy to describe, but this model 
already has interesting features. In stochastic models for epidemics in large, 
unstructured homogeneously mixing populations (that is, every pair of indi¬ 
viduals makes contacts at the same rate) branching process approximations 
can be used to compute the probability that an epidemic will occur if a 
disease is introduced in a population. In case of an epidemic, we can also 
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use branching process approximations to compute the expected fraction of 
the popnlation that is infected throughout the course of the epidemic. 

The homogeneous mixing assumption is too strong. One way to gain 
realism is to assume multiple levels of mixing. A popular extension is the 
so-called honsehold model Ball et al., 1997 . In this model the popnla¬ 


tion is partitioned into honseholds of relatively small size. Within house¬ 
holds, contacts are more frequent than in the general population. Inspired 
by the modeling of the spread of childhood diseases, this model has been 
extended further by also introducing an independent partition of the popu¬ 
lation into schools (or workplaces) [Ball and Neal 2002 


Pellis et al., 2009 


2012 . For these diseases, the spread in schools plays an important role and 


the household-school model is a natural model in this case. 

The assumption that the partition in households and schools are inde¬ 
pendent has as a consequence that in large populations it is unlikely that 
members from the same family attend the same school. However, it would 
be more realistic to assume that siblings do go to the same school. This idea 
leads to the hierarchical model which we define below. 

Mathematically speaking, the independent partition model is the easier 
one to understand. It would be of great theoretical and practical interest if 
we could show that epidemics spread more easily (the precise meaning of this 
is explained in the following sections) in the independent partition model 
than in the more realistic hierarchical model. Indeed, control strategies 
which are known to work in the independent model, will then also stop 
epidemics in the hierarchical model. 

We show in Theorem 12.31 that in case the sizes of schools are all the same 
and the sizes of households are all the same, then indeed an epidemic spreads 
easier in the independent partition model. We use branching process approx¬ 
imations for this conclusion. However, if the sizes of schools and households 
are variable, then, perhaps surprisingly, this is not true in general, see The- 
The branching process approxiamtion in the hierarchical model is 


orem 


2.4 


new and interesting in its own right. 


2 The model and main results 

2.1 Social structure 

We consider two models for a popnlation structure. In either model, indi¬ 
viduals are part of exactly one household and exactly one school. In the 
independent partition model, the partitions of the population into house¬ 
holds and into schools are independent. This model has been stndied as the 
household-workplace model before in e.g. [Ball and Neal[ [2002 


Pellis et al. 


2009, 2012 . In the hierarchical model, members of the same household at¬ 


tend the same school. 
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We can formally construct populations of either type with a given num¬ 
ber of n schools as follows. In the hierarchical model every school contains 
individuals from an independent and identically distributed (i.i.d.) num¬ 
ber of households, where this number of households the individuals in a 
school are part of is distributed as W- Households have i.i.d. sizes, dis¬ 
tributed as Nfi. Hence the number of individuals in a school is distributed 
as Ng ~ Y2k=i where Nj^\ k = 1,2,... are independent copies of N^, 
which are also independent of Nc- For mathematical convenience we assume 
that both Nfi and W have bounded support on the positive integers. We 
denote by = iV(n) the total number of individuals in the population, that 
is, N = where ivi^^ denotes the number of individuals in the 

k-th. school. 

In the independent partition model we use the school and household 
sizes from the hierarchical model and use the independent partitions of the 
N individuals in the population, uniformly chosen among all partitions with 
respectively the required household sizes and required school sizes. 


2.2 SIR epidemics 


We consider a stochastic SIR epidemic in a closed population. In this model, 
individuals are in one of the three states, S, I and R. If a susceptible indi¬ 
vidual contacts (note Remark |2.1| below) an infectious individual, the sus¬ 
ceptible one becomes infectious immediately (she is infected) and stays so 


for exactly one time unit (but note Remark 2.2 below). After this infec¬ 
tious period the individual recovers and stays immune forever. There are 
three types of contacts: pairs of individuals which are in the same household 
make household-contacts according to Poisson processes with intensity \h- 
Similarly, pairs of individuals within the same school make school-contacts 
according to Poisson processes with intensity A^. Finally, all pairs of individ¬ 
uals in the population make global-contacts according to Poisson processes 
with intensity \g/{N — 1). All Poisson processes are independent of each 
other. Individuals which are in the same household and in the same school 
have a total contact intensity of A/i -|- -|- Xg/{N — 1). We assume that 

initially there is a randomly chosen individual with a remaining infectious 
period of 1 time unit and all other individuals are susceptible. We assume 
that the spread of the epidemic - if it spreads - is so fast, that “demographic” 
processes such as births, individuals reaching the school-going age or moving 
to another school during the epidemic can be ignored. 

A household epidemic is defined as an outbreak which occurs if all global 
and school contacts are ignored; a household epidemic is always restricted 
to one household. Similarly a school epidemic is an outbreak which occurs 
if all global and household contacts are ignored. 

In this paper we are mainly interested in the final size of an epidemic, 
that is, the fraction p of individuals which are infected throughout the epi- 
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demic. We use (and state the arguments for this) that in the large population 
limit, if the fraction of infected individuals is positive (i.e. a major outbreak 
occurs) than with probability tending to 1 (as the population size grows) 
this fraction is equal to the probability of a major outbreak. 


Remark 2.1 Contacts as defined above, are not necessarily identical to 
physical contacts. Only encounters which lead to the transmission of the 
disease if one of the individuals is infectious and the other susceptible are 
considered to be contacts. So, if only half of the contacts of an infectious 
individual with a susceptible leads to transmission, then we can model the 
“infectious contacts” by thinning all original Poisson processes representing 
physical encounters. The remaining points are still distributed according to 
a Poisson process, now with half the density of the original process. 


Remark 2.2 The assumptions that (1) the infectious period is non-random, 
(2) the infectious period starts immediately at infection, and (3) that the 
contacts are described by homogeneous Poisson processes, are too strong. 
They might be replaced by the assumption that for every (ordered) pair of 
individuals the event that the first individual, if infectious, contacts the sec¬ 
ond individual is independent of contacts between other pairs of individuals. 
In particular, the methods and results of this manuscript apply to models 
in which the infectious periods of individuals are not random. This inclused 
SEIR epidemic models with non-random infectious period, in which there 
is a random exposed (latent) period between the moment an individual is 
infected and the moment that it starts to be infectious. See 
1982, Meester and Trapman, 2011 for a discussion. 


Kuulasmaa 


2.3 Results 


Our results are twofold. In the first place we introduce certain branching 
processes (sometimes multi-type) which enable us to carefully describe the 
initial phase of an epidemic. As far as we are aware, our methods for com¬ 
puting the quantities of interest in the hierarchical model, in particular the 
approximating branching process, are new. The various branching processes 
used for the two models are somewhat hard to compare directly since the 
units of the various branching processes are not the same. In order to use the 
branching processes and make actual computations, we need to know how to 
make exact computations for epidemics restricted to households or schools, 
which are relatively small compared to the total population. This part is 
carried out in Section 3, while the actual branching process approximations 
are described in Section 4. Our strategy for the independent partition model 
is similar to the computations suggested by Ball and Neal [2002 and our 
results are in agreement with theirs. We note that Ball and Neal [2002 do 
allow for random infectious period and their model is in that sense more 
general than ours. 
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In the second place we are interested in direct comparison of the inde¬ 
pendent and the hierarchical model. We prove the following theorem. 

Theorem 2.3 Consider a hierarchical and independent partition model in 
which households and schools have non-random sizes. Let n he the number 
of schools and let Z^{n) and Z\n) denote the number of ultimately recov¬ 
ered individuals in the hierarchical model and independent partition model, 
respectively. Then, for any fixed k, we have 

liminf \F{Z^(n) < k) — F{Z\n) < A:)| >0. (1) 

Hence for fixed household and school sizes, the epidemic spreads easier in the 
independent model. Note that the theorem implies that the probability of a 
large outbreak in the hierarchical model is bounded above by the probability 
of a large outbreak in the independent partition model. 

The assumption that all households and all schools have non-random 
sizes cannot be deleted in general. This is shown in the following theorem. 

Theorem 2.4 If we allow for variation in the sizes of the household and 
school in the population, then Q does not hold in general. In particular, 
we have the following two counterexamples. 

Let pfi := I — e~^^ and ps := 1 — . In either of the following two 

situations, 0 does not hold for j large enough: 

1. For some fixed (and large) j, households have size j with probability 
{2j)~^, and size 1 with probability 1 — {2j)~^. Furthermore, Nc = 1, 
that is, in the hierarchical model schools contain exactly one house¬ 
hold. Furthermore, Ps = Ph = 2(3j)“^ and Xg = 1/10. 

2. All households have size 2, so = 2. For some fixed (and large) j, 
XIc = 3 with probability (4j)“^ and W = 1 with probability 1 — (4j)“^. 
Furthermore, Xg = 1/20, ph = I and ps = (3j)“^. 

Of course one example would suffice to show that 0 does not hold in 
general, but the reason why Q does not hold is somewhat different in the 
two examples, and therefore we present them both. 

We prove these theorems in Section 

Remark 2.5 The examples in Theorem |2.4| are obviously extreme and cho¬ 
sen such that we can exploit dependencies which do not appear if all house¬ 
holds and schools have the same size. In the hrst example there are huge 
differences between the sizes of the households. Individuals which are part of 
a large household are automatically part of a large school in the hierarchical 
model, while the sizes of the school and the household of an individual are 
independent in the independent partition model. The dependence can be 
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used to increase the offspring mean of the branching process approximating 
the epidemic in the hierarchical model, defined in Section]^ In particular, 
the branching process can become supercritical in the hierarchical model, 
while without the dependencies it would be sub-critical. 

In the second example, all households have the same size and are rel¬ 
atively small, but the variance in school sizes is large. Here we use that 
household members of individuals in a large school are also part of a large 
school in the hierarchical model, while this is not automatically the case in 
the independent partition model. In particular, the parameters of the model 
are chosen in such a way that if we ignore all global contacts, expected epi¬ 
demic sizes will be larger in the hierarchical model than in the independent 
partition model. 

Less extreme examples could be used, but computations would be messy 
and the examples would rely on the same principles. 


3 Random graphs and local epidemics 


It is useful to describe the collection of ultimately recovered individuals by 
means of a random graph in which vertices represent individuals. This is a 
classical approach, see e.g. [Cox and Durrett , 1988 . The graph is built up as 
follows. For every vertex we draw, with probability ph = 1 — e~^^, directed 
“household edges” to each of the other vertices corresponding to individuals 
in its household. Similarly we draw, with probability ps = 1 — e~^‘’ directed 
“school edges” to each of the other vertices representing individuals in its 
school. Finally we draw, with probability pg = 1 — directed 

“global edges” to each of the other vertices in the population. All edges are 
drawn independently of each other. 

The endpoints of respectively household, school or global edges starting 
at a given vertex correspond to the individuals that will be contacted by 
the individual represented by this given vertex via respectively household, 
school or global contacts during its infectious period, were it to be infected 
in the course of the epidemic. The set of individuals infected in the course 
of the epidemic started at a randomly chosen individual is distributed as the 
set of vertices that can be reached by a directed path starting at the vertex 
representing such a randomly chosen individual. 

It is well known [Cox and Durrett 


1988 that the set of vertices which 


can be reached from the vertex representing the initially infected individ¬ 
ual has the same distribution as the cluster of this vertex in an undirected 
graph in which undirected (household) edges are drawn independently be¬ 
tween (pairs of) vertices representing members of the same household with 
probability ph, undirected (school) edges are drawn independently between 
vertices representing members of the same school with probability ps and 
finally, all pairs of vertices share a (global) edges with probability pg. In this 
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undirected graph, the epidemic generating graph, there is no time evolution. 
The school cluster of a vertex is the cluster of the vertex in the epidemic 
generating graph if all global and household edges are ignored. This clus¬ 
ter corresponds to the individuals infected through a school epidemic if the 
individual corresponding to the index vertex gets infected. The household 
cluster of a vertex is the cluster of the vertex in the epidemic generating 
graph if all global and school edges are ignored. The household cluster has 
a similar interpretation as the school cluster. 

We state without proof that for both the hierarchical and the indepen¬ 
dent model, the epidemic generating graph has, in the large population limit 
with probability tending to 1, at most one cluster of the same order of size 
as the population. The fraction of the number of vertices in this cluster 
converges in probability to a constant as the number of schools goes to in¬ 
finity. The proof of this fact runs along the lines of the proofs of similar 
statements for Erdos-Renyi graphs [Durrett 2006| (cf. [Ball et ah 2009] ). It 
implies that if the initially infected individual is chosen uniformly at random 
from the population, then the probability of a large outbreak is the same 
as the fraction of individuals which are ultimately removed. Indeed, if we 
first construct the epidemic generating graph and then choose the vertex 
representing the initial infectious individual uniformly at random, then it 
is straightforward to see that the probability of a large outbreak and the 
fraction of the individuals which are infected during such an outbreak are 
both equal to the fraction of the vertices in the large cluster. 

Next we present exact computations in small populations. Since we 
apply this to either household or school epidemics, we consider the situation 
in which there is only one type of contact, that is, the epidemic generating 
graph is built up in such a way that undirected edges between any pair of 
vertices exist with probability p, independently of all other pairs. The result 
is a special case of Diekmann et ah, 2012, p.281]. 


Theorem 3.1 ( [Diekmann et al. , 2012j ) Consider a standard SIR epi¬ 
demic in a homogeneously mixing population in which initially 1 individual is 
infectious and L individuals are susceptible. Let, forO < k < L, = P^{p) 
denote the probability that k members out of the initial population of L sus- 
ceptibles are ultimately recovered. Then we have for all 0 < i < L, 




k=0 


£-k 


( 2 ) 


Applying this result for £ = 0,1,..., L consecutively enables us to effi¬ 
ciently compute Pq, P[, ..., P^. 

This theorem has a multi-type variant [Ball 1986 
in the analysis of the hierarchical model. 


which we will use 
Let A: be a positive integer. For 


i := {ii,... ,£k) and L := (Li,... ,Lfc), £ < L is dehned to mean £{ < Li for 
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all 1 < i < fc. Furthermore, we write 



and 


e. lx 4 


u =0 Fii =0 ^^^=0 


Let -pij be the probability that a given type-i individual contacts a given 
type-j individual during its infectious period in case the type-i individual 
is infected. We assume that initially there is one infective individual, which 
without loss of generality can be chosen to have type 1. 


Theorem 3.2 ( [Ball , 1986| ) Consider a population subdivided into k dif¬ 
ferent types of sizes L = (Li + 1, L 2 ,..., L^), where 1 individual of type 
1 is initially infectious and all other individuals are initially susceptible. 
Let Pu be the probability that the vector of numbers of ultimately recovered 
individuals (not including the initial infective) in the epidemic is equal to 
u = (ui,... ,Uk). Then for each 0 < £ < L rce have 


E 

u =0 


L — u 

I — u 


k / k 

i=i 




Li — 


*=1 \j=l 


(3) 


4 Branching process approximations 


For SIR epidemics in large homogeneously mixing populations, branching 
process approximations are reasonable since it is unlikely that during the 
early stages of the epidemic contacts of infectious individuals are made with 
non-susceptibles (see e.g. Diekmann et al., 2012 or use birthday problem 


arguments). However, within schools or households, the epidemics take 
place in small groups, and a standard branching process approximation is 
no longer viable. In this section, we explain that despite the existence of local 
epidemics inside schools or households, a branching process approximation 
can still be carried out. We carry out the approximations separately for the 
two types of models. We reserve the word “child” to refer to the offspring of 
a particle in the branching processes, not to a child in the actual population. 


4.1 The independent partition model 

In this subsection we show that we can carry out a branching process ap¬ 
proximating in the independent partition model, where the units are in¬ 
dividuals of the population and which will have three types. The three 
types of particles in the branching process correspond to individuals in¬ 
fected through either global, school or household contacts respectively. The 
number of global children in the branching process of each particle is Pois¬ 
son distributed with mean pLc- Particles corresponding to individuals not 
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infected through a school contact have a number of “school-children” dis¬ 
tributed as the size of a school epidemic of a randomly chosen individual 
in the population. Similarly, particles corresponding to individuals not in¬ 
fected through a household contact have a number of “household-children” 
distributed as the size of a household epidemic of a randomly chosen indi¬ 
vidual in the population. Below follows a more detailed approach. 

Imagine that we know the number and sizes of the schools and the house¬ 
holds in the partition, but that we have not assigned individuals to the 
partitions yet. Let n be the number of schools and n' be the number of 
households, and number the schools from 1 to n and the households from 1 
to n'. 

If we choose an individual uniformly at random from the population, 
then the probability that we choose an individual from a household of size 
k is given by the size biased distribution kF{Nh = k)/K{Nfi). Indeed, it is 
k times more likely to be in a given household of size k than it is to be in 
a given household of size 1. A size biased variant of a random variable is 
decorated with a tilde, so F{Nh = k) := kF{Nh = k)/F{Nh). Similarly the 
school size of a uniformly at random chosen individual is distributed as Ng 
and the number of households within the school of a uniformly at random 
chosen individual is distributed as Nc- 

We now create an i.i.d. sequence of randomly chosen schools by picking 
schools with replacement according to a size biased distribution. (There is a 
technical detail here which we ignore: in order to draw with replacement and 
to perform couplings below, we actually need to use the empirical distribu¬ 
tions for household and school sizes, i.e. the distribution determined by the 
actual (finite) sequence of household and school sizes. A rigorous treatment 
of this detail can be found in [Ball et ah 2009| .) Say that this sequence is 
x(l),x(2),.... Let Tg be the first repeated index in this sequence, that is, 
Tg = min{j : x{j) = x{i) for some i < j}. Similarly create an infinite i.i.d. 
sequence of households drawn (with replacement) according to a size biased 
distribution. We denote this sequence by x'(l),x'(2),... and let be the 
first repeated index in the sequence of households. Since the school and 
household sizes have bounded support, a birthday type argument gives that 
for c G (0,1/2), we have F{Tg < n'^) —> 0 and F{Th < rf) —> 0 as n —> oo. 

We now describe the coupled construction of the branching process and 
the epidemic generating graph. In the cluster of the initial infective in 
the epidemic generating graph and the approximating branching process we 
distinguish between three types of vertices: “global”, “school” and “house¬ 
hold”, where the type of a vertex is the type of the edge through which 
the vertex enters the process. The ancestor of the branching process (and 
the uniformly at random chosen vertex used to start the exploration of the 
epidemic generating graph) receives the type “global”. 

We associate to the ancestor in the branching process school x(l) and 
household x'{\). Assume that school x(l) has size I, then the number of 
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children of the ancestor with type “school” is equal to k with probability 
Pjr^{ps). (Here, the —1 in / — 1, conies from the fact that in this school 
epidemic there are I — 1 initial susceptibles). If the household x'(l) has 
size k then the number of children of the ancestor with type “household” is 
equal to k' with probability and this number is independent of the 

number of “school” children. Finally, the ancestor also has a Poisson number 
of “global” children with expectation Xg. This number is independent of the 
number of “household” and “school” children. 

Important to note is that, in order to be able to use branching process 
approximations, we treat all vertices in a school/household cluster of the 
ancestor in the epidemic generating graph as its children, while in reality 
the ancestor and child might not share an edge, but only have a path of 
school/household edges between them. We need this technique of assigning 
vertices to a generation to keep branching process approximations mean¬ 
ingful in the sense that we retain enough independence (cf. [Pellis et al. 

To the “global” children we assign both households and schools: if the 
number of “global” children is k then we assign schools x{2) up to and in¬ 
cluding x{k + 1) and households x'{2) up to and including x'{k' + 1) to those 
children. Furthermore, we assign households to the “school” children (the 
following households in the sequence) and schools to the “household” chil¬ 
dren (the following schools in the sequence). As long as the total number of 
assigned schools is less than Tg and the total number of assigned households 
is less than T^, we can proceed with the construction in the obvious way. In 
this construction we create part of the epidemic generating graph through 
a branching process. 

If the coupling proceeds then we assign vertices to school x(l) and house¬ 
hold a:'(l) in such a way that the household and school overlap at only one 
vertex, say vertex v, which is the vertex corresponding to the ancestor in the 
branching process. If in the branching process the ancestor has k “school” 
children, then the size of the cluster of v created by school edges is A;-|-1 (the 
-|-1 is because v is also part of the cluster). In the same way we choose the 
cluster of v created by household contacts in x'(l). If the number of “house¬ 
hold” children of the ancestor is k', then this cluster has size P + 1. Finally, 
global edges are drawn to vertices which are part of households and schools 
which (i) only overlap with each other at the chosen vertices itself, and (ii) 
do not overlap with the households and schools of vertices in households and 
schools already explored. 

The next step is to assign individuals (and the schools and households 
they belong to) to the generation 1 vertices. This happens in exactly the 
same way as individuals were assigned to the ancestor, apart from the fact 
that “school” individuals do not have “school” children, since their school 
is already explored, and “household” individuals do not have “household” 
children. As long as the number of schools assigned to individuals does not 
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exceed Tg and the number of households assigned to individuals does not 
exceed Th, the construction proceeds. 

Since the probability that Tg or is less than for c G (0,1/2) goes 
to 0 as re —)> oo, the branching process approximation works with large 
probability for all small clusters (i.e. clusters of smaller order than re^/^) in 
the epidemic generating graph. If the vertex representing the initial infective 
individual is in such a cluster then the approximating branching process 
goes extinct. If the initial infective individual is part of a large cluster, i.e. a 
cluster which asymptotically contains a positive fraction of the graph, then 
standard arguments used in random graph theory (e.g [Durrett 2006, Ch. 
3]) show that the approximating branching process survives. 

The next step is to compute the probability that the constructed branch¬ 
ing process dies out. Every individual has a Poisson number of “global” 
children with mean Xg. Every “global” and “school” individual has a ran¬ 
dom number of “household” children, and we denote the probability that 
this number is equal to k by 


zh{k) ■.= Y,nNh = i + i)ptiPh), 

l=k 

where Pj-iph) is defined via Q. Similarly every “global” and “household” 
individual has a random number of “school” children. We denote the prob¬ 
ability that this number is equal to k by 

OO 

zs{k) ■.= Y,¥iNs = l + l)Piips). 
l=k 


“School” individuals do not have “school” children and “household” indi¬ 
viduals do not have “household” children, but apart from that the number 
of children of the different types are independent. 

It is well known [Jagers 1975, Ch. 4] how to compute extinction proba¬ 
bilities for multi-type branching processes. Define the probability generating 
functions of the offspring distribution as follows. Eor 0 < tg, < I, 


OO OO OO / \ \lc 

UhMk) ■■= Y.Y.Y. 

kg={)ks=0kh=^ 

OO OO 

ks=0 fc(,=0 

and similarly, 

OO 

fs{tg,th) ■= ^ Zh{kh){th)’"'^ (0 < < I), 

kh=0 

OO 

fh{tg,ts) ■■= ^ e~^o^^~^o''Zs{ks)its)^’’ (0 < tg,ts < I). 

ks=0 


II 

















Recall that we assume that the initial infective individual is a global indi¬ 
vidual. The probability of extinction of the branching process is equal to tg, 
where {tg,ts,th) is the smallest positive real solution of the following set of 
equations: 


ts — fs{tg,tfi)., 
th ~ fh{tgits)- 


It is equally well known |Jagers[ 1975, Ch. 4] how to quickly decide 


whether or not the probability that the approximating branching process 
survives is positive. Let mg = YlT=o^^sik) be the expected number of 
individuals infected in a school epidemic (excluding the initially infected 
individual in the school) and m/j = YlT=o kzh{k) be the expected number of 
individuals infected in a household epidemic (exuding the initially infected 
individual in the household). Define the so called next generation matrix of 
the branching process by 


/ \g nis nih \ 

M = I Xg 0 ruh \ . (4) 

\ Xg rus 0 / 

The probability of extinction of the branching process is strictly less than 1 if 
and only if the largest eigenvalue of M (which is positive and real) is strictly 
larger than 1. A small computation shows that this is the case exactly when 


Xg{ms + l){mh + l) > l-rngnih = {ms + l) + {mh + l)-{ms + l){mh + l). (5) 


These results corresponds to the results in [Ball and Neal , 2002| . This largest 
eigenvalue of M is often referred to as the basic reproduction number, Rq 
[Diekmann et ah 2012, Pellis et ah, 2012| . In Section]^ we will give some 
examples where is used. 


4.2 The hierarchical model 

In the hierarchical model the spread within its household and within its 
school caused by an individual are no longer independent, since households 
are entirely contained in schools. We get around this problem by chang¬ 
ing the unit of the branching process. In particular, the particles in the 
branching process no longer correspond to individuals, but to clusters of in¬ 
dividuals in the same household. Moreover, the type of a particle is given by 
the number of individuals in its corresponding cluster. Below we derive the 
offspring distribution for this branching process. However, note that there 
are no easy closed expressions available for describing this distribution. 

Consider a household of size k, say. The epidemic generating graph 
restricted to this household and restricted to household edges, partitions the 
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household into clusters. The joint distribution of the sizes of the clusters in 
the partition can be obtained via Q. Indeed, the size of the first cluster 
(in order of exploration) is li with probability Conditional on the 

size of the first cluster being li, then the size of the second cluster is I 2 with 
probability , and so on. In this way, every household is partitioned 

into clusters with the property that if one of the individuals in the cluster 
gets infected, then all the vertices in that cluster get infected. Further 
infections within that household have to go through either school and global 
contacts, or through individuals outside the household. 

Instead of considering a school as partitioned into households, we view 
a school as partitioned into clusters generated by household edges. The 
sizes of those clusters are not independent. The joint distribution of these 
cluster sizes is difficult to describe explicitly, but, as described in the previous 
paragraph, computationally relatively easy to deal with. 

To compute the final size of an epidemic restricted to school and house¬ 
hold contacts for a school with a given configuration of households in it, we 
first assign types to the clusters generated by household contacts, where the 
type is the number of vertices within the cluster. Those clusters are the 
“super-individuals” and we apply Q to compute the final size within the 
school, withpij = 1 —(1—Note that 1—Pij is the probability that there 
is no school contact between any of the individuals in the type i cluster and 
any of the individuals in the type j cluster. In order to compute the number 
of individuals infected in an epidemic restricted to a school, let Ui denote the 
number of clusters of size i which are ultimately infected through school con¬ 
tacts (now including the initially infected cluster within the school), which 
can be computed by using Q. The final size of the epidemic restricted to 
household and school contacts is then ~ 1- (The —1 originates from 

the fact that the initial infective individual within a school is not included 
in its final size.) Even for moderately large household and school sizes this 
sum has already many terms and we refrain from making the sum explicit. 

In the previous paragraph we assumed that the sizes of the households 
in a school are known. In order to perform further computations, we need to 
describe what the distribution is of such configurations for schools which are 
affected by the epidemic during the early stages of the epidemic. To do this 
we observe that if we choose a vertex uniformly at random then its household 
size is size biased and distributed as N^- The number of households in the 
school this vertex is part of is then distributed as Nc- All other households 
in this school have sizes distributed as N^. We can use this to find the 
distribution of the size of the cluster created by school and household edges 
which contains a uniformly at random chosen vertex from the population. 
Let Y have the same distribution as this random variable. Note that the 
uniformly chosen vertex is incorporated in Y. The distribution of Y is 
difficult to describe in closed form, but it is computationally tractable. 

Finally we can describe the branching process. Since school sizes have 
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bounded support, a giant component in the epidemic generating graph 
means that many schools are infected. This suggests that we can consider a 
branching process of initial cases in schools (and therefore a branching pro¬ 
cess of infected schools cf. [Ball and Neal[[T002] ). The (direct) offspring of a 
particle of the branching process consists of all vertices that can be reached 
by a path of school and household edges, apart from the final edge which is 
global and leads to a new infected school. So, the direct offspring of a par¬ 
ticle correspond to all vertices which can be reached by a global edge from 
one of the vertices corresponding to an epidemic restricted to household and 
school contacts from the individual corresponding to the particle. If this 
approximating branching process survives then the vertex corresponding to 
the initial infectious individual is in the giant component (with large prob¬ 
ability as n —>■ oo), while if this branching process goes extinct than the 
cluster of the vertex corresponding with the initial infectious individual is 
also small (compared to n). The number of children of an individual in the 
branching process is distributed as Z 
Poisson random variables with mean A, 
branching process is a single type branching process for which the extinction 
probability, q, is the smallest root of t = YlT=o ~ 


where the X^’s are i.i.d. 
g, which are independent of Y. This 


Jagers 


1975 


This smallest root is strictly less than 1 if and only if the offspring mean 
R* = E(Y)Ag > 1. We give some examples of how to use these computations 


in the proof of Theorem 2.4 below. 


5 Proof of Theorems 12.31 and 12.4 


Proof of Theorem We use a coupling of the epidemic processes on the 
hierarchical and the independent partition models similar to the one used for 
the branching process approximations in the previous section. This coupling 
is then used to show that {n) and Z^ {n) are asymptotically stochastically 
ordered as stated in Theorem 12.31 

Again we consider an i.i.d. sequence x(l), x(2),... of schools, by uni¬ 
formly picking schools with replacement. Let Tg = Ts{n) be the index of 
the first repeated school in this sequence. Similarly create an infinite i.i.d. 
sequence x'(l), x'(2),... of households drawn uniformly with replacement, 
and let Th = Th{n) be the index of the first repeated household in this 
sequence. Let Ak = Akin) = {Tg > A: -|- 1} n {T^ > A; -|- 1}. We have 

P(Z^(n) <k)< F{{Z^in) < k} n Ak) + T(.A^). 

By birthday-problem type arguments we have that for all <5 > 0, IP(Al|) < 6 
for sufficiently large n. Since F{{Z^{n) < A;} n Ak) < F{Z^{n) < k), it 
suffices to prove that 


P({Z^ (n) < A:} n Ak) < FiiZ^in) < k} D Ak), 


( 6 ) 
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or equivalently, 


F{Z\n) < k\Ak) < F{Z^{n) < k\Ak) 


(7) 


We simultaneously construct the epidemic generating graph of the hi¬ 
erarchical and of the independent partition model on a suitable probabil¬ 
ity space and show that for all k, and conditioned on Ak, the inclusion 
{Z^ < fc} C {Z^ < k} holds. 

Let x(l) be the school of the initially infected individual both for the 
hierarchical and the independent partition model. Let a school epidemic 
run in this school and use this epidemic for both the hierarchical and the 
independent partition model. Note that this gives the right distribution of 
the first school epidemic in both models. 

Now assume that the size of the school cluster in the corresponding 
epidemic generating graph is j, where j < k. In the independent partition 
model the households of the j individuals already infected are ... ,x'{j). 

Those households do not overlap since we condition on Ak. The size of 
the household epidemics (including the initial infected within the house¬ 
hold) are then i.i.d. and all distributed as the random var iable X, where 
P(X = i) = where is defined as in Theorem 


3.1 


In the hi¬ 
erarchical model, the households of the j individuals affected by the school 
epidemic do not need to be all different. The household epidemics are now 
run one by one, the initial infectives for those household epidemics are the 
j individuals affected by the school epidemic, we do however ignore the in¬ 
dividuals already infected before (by the school epidemic, or by household 
epidemics explored earlier). The probability that i individuals are ultimately 
infected through such a household epidemics is given by where fih is 

the number of individuals in the household not affected before by the epi¬ 
demic. Observing that hh < u/i — 1 gives that the number of vertices affected 
by such a local epidemic is always smaller than the same quantity in the in¬ 
dependent partition model. 

The next step is to investigate the school epidemics of the individuals 
infected through household contacts. Note that in the independent partition 
model, the number of susceptible schoolmates of the individual infected 
through a household contact is — 1 (by the conditioning on Ak), while 
in the hierarchical model this number is at most n* — 2. We proceed in 
this way analysing the epidemic through school and household contacts and 
we notice that the size of the cluster generated by school and household 
edges in the epidemic generating graph in the hierarchical model is bounded 
above by the same quantity as in the independent partition model. Let 
(respectively C/) denote this cluster in the hierarchical model (respectively 
independent partition model). If the number of vertices in such a cluster 
in the hierarchical or independent partition model is at most I, then the 
number of households and schools investigated is at most I since each vertex 
is part of exactly 1 school and 1 household. 
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The next step is to investigate the global edges from the vertices in 
and Cf, assume that Ci has size I' and C{ has size 1. Note that I' <1. We 
keep the two epidemic generating graphs coupled, by using a sequence of V 
i.i.d. Poisson numbers with mean \g and use these for the number of global 
contacts of the hrst I' vertices in C/ and for the I' vertices in In addition 
we independently assign i.i.d. Poisson numbers (with mean A^) to the other 
I — I' vertices in C/. Note that if I plus the total number of global edges 
from cl does not exceed k, then, because of the conditioning on Ak, the 
globally contacted vertices are all in different households and schools and 
also not in households and schools encountered before. In the hierarchical 
model we assign the same schools to the globally contacted vertices as in 
the independent partition model or a subset of those. Then we proceed with 
investigating the epidemic generating graph by investigating the new schools 
in the same way as we investigated the school of the initially infected vertex. 

By this coupling we obtain that if the exploration of the cluster stops 
before k+1 vertices are included in the independent partition model, then in 
the hierarchical model the cluster size is also less than k+1. Note that if we 
have explored the cluster in the independent partition model until we have 
included k vertices and there are still school, household or global edges not 
yet explored in the construction then conditioning on Ak guarantees that 
this k+l-st edge is to a not yet encountered vertex and so the cluster of the 
initially chosen vertex is at least k+1. This guarantees that conditioned on 
Ak, we have {Z^ < fc} C {Z^ < k} and completes the proof. 

Proof of Theorem \2.4\ We start the computations with Example I. In 
this example households have size j with probability (2j)“^, and size 1 
with probability 1 — {2j)~^. Furthermore, in the hierarchical model schools 
contain exactly one household. We set Ps = Ph = 2(3j)“^ and Xg = l/IO. 
We then choose j large enough to support our claim. Observe that the 
fraction of individuals within a household of size 1 is given by 

1 - (2j)-i _ 2j - 1 ^ 2 

Consider the independent model. Since j is large, we may approximate 
the epidemic within a household or school of size j by a sub-critical branching 
process with offspring mean 2/3. The total number of ultimately removed 
individuals in a school/household is then roughly = 3. In a 

school or household of size I, there are no secondary individuals. It is easy 
to check with Q that M has largest eigenvalue less than 1, and therefore 
that the epidemic in the independent partition model is sub-critical. 

In the hierarchical model, the probability of having an edge between two 
vertices in the same household is 1 — {1 — ps){l — Ph){^ — Pg) and since ps, Ph 
are small and pg is even much smaller than that (pg ~ X/N, where N is the 
total population size), we have that 1 — (1 —Ps)(l — ph){l — Pg) ~ Ps +Ph = 
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4(3j)“^. If j is large this leads to a supercritical Erdds-Renyi graph, if the 
epidemic generating graph is restricted to a combined household and school. 
In particular the largest cluster in a large school is of order j; say that it 
is with probability 1/2 at least aj, where a > 0. We claim that each large 
component is in expectation, via global edges, connected to Ag x aj x 1/3 x 
l/2xa other components of size at least aj. Indeed, approximately one third 
of those edges connects to other vertices in large schools/households, half of 
them contains a combined household/school cluster of at least size aj and a 
is the probability that this edge actually ends in the large household/school 
cluster. If j is large enough, the combined quantity is larger than 1 and 
there is a cluster of household/school clusters of size at least aj all connected 
through global edges which itself has size of order N. This shows that the 
epidemic generating graph has a cluster of the same order of magnitude as 
the population, which implies that Q does not hold. 

Next we consider Example 2. Recall that in this example all households 
have size 2. In the hierarchical model schools contain j households with 
probability (4j)“^ and only 1 household with probability 1 — (4j)“^. Fur¬ 
thermore, \g = 1/20, ph = 1 and ps = (3j)“^. Similarly to Case 1, we 
deduce that rus Ri 7/5 and mh = 2. This implies, using Q again, that the 
epidemic in the independent partition model is subcritical. 

In the hierarchical model, we see the households as “super-individuals” 
(since if one of the two household members gets infected the other will auto¬ 
matically get infected as well). So we can consider a social structure which 
only contains schools. Two (super) individuals (i.e. households) contact 
each other with probability 1 — (1 — (3j)“^)^ ~ 4(3j)“^. Again, the epi¬ 
demic generating graph for the school epidemic is an Erdds-Renyi graph, in 
which vertices on average share edges with 4/3 other vertices. Therefore, 
the Erdds-Renyi graph contains, with large probability, a cluster of order j 
and we can copy the argument from the previous example. 

6 Discussion 

We have discussed an ordering of epidemic severity of infectious diseases in 
two extreme population models. We stress that the comparisons are about 
the final size and the probability of a large outbreak, and not about the 
reproduction number Rq (the offspring mean of the approximating branching 
process). Indeed, since the units of the branching processes are so different 
we have different interpretations of Rq in the various models and it makes no 
immediate sense to compare this value for the two models directly. This is 
further illustrated in Figure[^where we plot Rq and the survival probability p 
against the infection rates. In this figure we keep the proportions Xg : Xg : Xh 
fixed at 1 : 2 : 4, while household sizes are 2 and schools have size 4. We 
note that while the survival probability of the independent partition model 
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is at least as large as the survival probability for the hierarchical model, Rq 
is not ordered in this way. However the infection rates for which Rq crosses 
the threshold 1, are lower for the independent partition model, as it should 
be, since the survival probability (for which there is an ordering) is strictly 
positive if and only if iio > 1- 
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Figure 1: The basic reproduction number Rq (a) and the survival probability 
p (b) as a function of the global infection rate \g for a model in which the 
proportions of the rates are Ag : As : A/i = 1 : 2 : 4 and all households have 
size 2 and all schools have size 4. The hierarchical model is represented by 
the solid lines, while the independent partition model is represented by the 
dashed lines. 
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